منابع مشابه
Hyperbolically Discounted Temporal Difference Learning
Hyperbolic discounting of future outcomes is widely observed to underlie choice behavior in animals. Additionally, recent studies (Kobayashi & Schultz, 2008) have reported that hyperbolic discounting is observed even in neural systems underlying choice. However, the most prevalent models of temporal discounting, such as temporal difference learning, assume that future outcomes are discounted ex...
متن کاملTwo parameter-tuned meta-heuristics for a discounted inventory control problem in a fuzzy environment
In this paper, a nearly real-world multi-product, multi-period inventory control problem under budget constraint is investigated, where shortages in combination with backorders and lost sales are considered for each product. The ordered quantities of products are delivered in batch sizes with a known number of boxes, each containing a pre-specified number of products. Some products are purchase...
متن کاملLearning Rates for Q-Learning
In this paper we derive convergence rates for Q-learning. We show an interesting relationship between the convergence rate and the learning rate used in Q-learning. For a polynomial learning rate, one which is 1/t at time t where ω ∈ (1/2, 1), we show that the convergence rate is polynomial in 1/(1− γ), where γ is the discount factor. In contrast we show that for a linear learning rate, one whi...
متن کاملQ-learning for Robots
Robot learning is a challenging – and somewhat unique – research domain. If a robot behavior is defined as a mapping between situations that occurred in the real world and actions to be accomplished, then the supervised learning of a robot behavior requires a set of representative examples (situation, desired action). In order to be able to gather such learning base, the human operator must hav...
متن کاملP14: Anxiety Control Using Q-Learning
Anxiety disorders are the most common reasons for referring to specialized clinics. If the response to stress changed, anxiety can be greatly controlled. The most obvious effect of stress occurs on circulatory system especially through sweating. the electrical conductivity of skin or in other words Galvanic Skin Response (GSR) which is dependent on stress level is used; beside this parameter pe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Japan Society for Fuzzy Theory and Intelligent Informatics
سال: 2014
ISSN: 1347-7986,1881-7203
DOI: 10.3156/jsoft.26.913